Round 1 - Technical
๐น Introduce yourself
๐น Question on current project and past experience(technology used, project architecture, day to day task in your project, size of daily data)
๐น What is ADF? is it ETL OR ELT?
๐น What is linked service in ADF?
๐น Difference between linked service and dataset?
๐น What is ARM templates, how to move pipeline from dev to production environment?
๐น What is Integration Runtime? Types of IR and usage in details?
๐น What is Triggers in ADF and its types?
๐น Explain storage and Tumbling window trigger?
๐น What is Activity in ADF, name 5 activity used by you in project?
๐น Explain copy activity, metadata activity, lookup activity, foreach activity?
๐น What is mount point in azure databricks ,mount ADLS gen2 to databricks?
๐น 3 common roles in azure?
๐น Spark Architecture, types of mode in spark?
๐น What is Dataframe in pyspark? dataframe vs rdd?
๐น Why is RDD resilient and fault tolerant?
๐น Action vs Transformations?
๐น Optimization techniques used by you in spark?
๐น Reducebykey() vs Groupbykey()?
๐น Diff by persist and cache?
Round 2 - Managerial
๐น Introduce yourself?
๐น Project Architecture, Team size.
๐น What is COALESCE() in sql?
๐น What is window functions? row_number() vs rank() vs dense_rank()?
๐น SnowFlake schema vs Star schema
๐น One sql medium question
Round 3 - HR Discussion
๐น Salary negotiations